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Abstract 

Our ability to control complex systems is a fundamental challenge of contemporary science. 
Recently introduced tools to identify the driver nodes, nodes through which we can achieve full 
control, predict the existence of multiple control configurations, prompting us to classify each node 
in a network based on their role in control. Accordingly a node is critical, intermittent or redundant 
if it acts as a driver node in all, some or none of the control configurations. Here we develop an 
analytical framework to identify the category of each node, leading to the discovery of two distinct 
control modes in complex systems: centralized vs distributed control. We predict the control mode 
for an arbitrary network and show that one can alter it through small structural perturbations. 
The uncovered bimodality has implications from network security to organizational research and 
offers new insights into the dynamics and control of complex systems. 
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A dynamical system is controllable if it can be driven from any initial state to any desired 
final state within finite time CHU- In general, controllability can be achieved by changing 
the state of a small set of driver nodes, which drive the dynamics of the whole network [5]. 
For a linear time-invariant dynamics we can identify the minimum driver node set (MDS), 
representing the smallest set of nodes through which we can yield control over the whole 
system While the number of driver nodes sufficient and necessary for control ( N d) 

is primarily fixed by the network’s degree distribution, there are multiple MDS’s with the 
same N d that can maintain control. For example, for the five-node network shown in Fig. 
0 Nd = 3, but the formalism indicates that control can be achieved via three different 
MDS’s: {1,3,4}, {1,3,5} and {1,4,5}. 

Here we explore the role of individual nodes in controlling a network by classifying each 
node into one of three categories based on its likelihood of being included in MDS m- 
critical, meaning that a node must always be controlled in order to control a system (it 
is part of all MDS’s); redundant, meaning that it is never required for control (does not 
participate in any MDS’s) and intermittent, meaning that it acts as driver node in some 
control configurations, but not in others. For example, in Fig. [l}i node 1 is critical, node 2 
is redundant and nodes 3, 4, 5 are intermittent (Fig. HI 3 )- This classification leads to the 
discovery of a bifurcation phenomenon, predicting that a bimodal behavior determines the 
controllability of many real networks. This bimodality helps us uncover two control modes, 
centralized vs distributed. We demonstrate both analytically and numerically the existence 
of these two modes and show that the predicted control modes naturally emerge in a wide 
range of real networks. 

Results 

Identifying node categories. We developed an algorithm to identify the redundant 
nodes in a network with N nodes and L links in O(NL) steps, offering the fraction n I of 
redundant nodes for an arbitrary network (see Methods). We also proved that a node is 
critical if and only if it has no incoming links (Supplementary Note 1), a theorem that 
provides the fraction of critical nodes in a network as n c = P; n (0) where P m {k ) is the 
incoming degree distribution. Finally intermittent nodes are neither critical nor redundant, 
hence their fraction is n\ = 1 — n c — n r . 

Bimodality in control. To explore the role of the network topology we measured n r and 
n c for networks with varying average degree ( k ). We find that for small ( k) for an ensemble of 
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networks with identical degree distribution P{k ), n r and n c follow narrow distributions (Fig. 
gi). This means that n T and n c are primarily determined by P(k ) [2T1 - 12%] . Surprisingly, 
when (k) exceeds a critical value k c [25], P{n r ) becomes bimodal, implying that systems 
with the same P(k) can exist in two distinct states: some have small n r and for others n r is 
very large. This bimodality is present in both random [26| and scale free networks IZZIEH] 
(Fig. g. Supplementary Note 2). The emergence of this bimodal behavior is best captured 
by plotting n r vs ( k ), observing a bifurcation as ( k) reaches k c (Fig. If). This bifurcation 
predicts two distinct control modes: 

Centralized control: For networks that follow the upper branch of the bifurcation diagram 
most nodes are redundant (n r ). This means that in these networks one can achieve control 
through a small fraction of all nodes ( n c + n j), hence capturing a centralized control mode 
(Fig. gi). This has obvious consequences in communication systems, as one can ensure 
a centralized system’s security by protecting only a small fraction of nodes ( n c + ni); in 
an organizational setting centralized network may be better suited for task execution, like 
manufacturing, where efficiency is enhanced by subordination to a small number of control 
nodes. 

Distributed control: For networks on the lower branch n c +rii can exceed 90% of the nodes. 
Hence most nodes can act as driver nodes in some MDS’s, resulting in a distributed con¬ 
trol mode (Fig. g). Securing such distributed communication networks requires significant 
resources, as one can gain control of the system via a large number of control configura¬ 
tions. Yet organizations displaying distributed control may be more capable of harboring 
innovation, as different node combinations could take control of the organization’s direction. 

Intuitively, one would expect these two different control modes to be associated with 
distinct network properties. However, we find that networks with identical in- and out-degree 
distribution can develop centralized or distributed control modes with equal probability (Fig. 
If). For networks with different in- and out-degree distributions the symmetry between the 
two branches is broken, forcing the network in one or the other branch of the bifurcation 
diagram. This is illustrated for networks with P(fc in ) ~ and P[k ont ) ~ k~^ t out where 
7in % 7out- The degree asymmetry forces a network to follow one or the other branch of the 
bifurcation diagram (Fig. g), pre-determining whether the network displays a centralized 
or a distributed control mode. 

Analytical approach. To understand the origin of the observed control modes, we map 
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the control problem to maximum matching 0129], which provides the fraction of redundant 
nodes in infinite networks as 

n r = 1 ~ Gm(0 ou t), (!) 

where Gj n (x) = YlkLo xk P'm{k) is the generating function for P- m (k ) [22] . 

Here 9 out is the solution of the recursive equation 

1 - #out = H ovA {l - -H’in(^out)), (2) 

where H oat> in (x) = J2kLi ^ fc-1 Qout,in(/c) and Q on t,m{k) = Pont,m(k) x k/(k) is the excess 
degree distribution. Equations (JTj) and (J2| self-consistently predict n r from the network’s 
degree distribution, in excellent agreement with the numerical results (see the continuous 
lines in Fig. §>, c), helping us understand the origin of the observed bifurcation. For P\ n (k) = 
Pout(k) equation (J2]) has a single solution for small (k) (top panel in Fig. [2]f). As (k) reaches 
k c , equation (J2]) develops three solutions, two of which are stable, corresponding to the two 
branches of the bifurcation diagram. As the new solutions are an analytic continuation 
of the (k) < k c solution, the system can continue in any of the two branches, hence the 
centralized or the distributed control modes are equiprobable. If, however, P in (k) ^ P 0 nt(k), 
for (k) > k c only one of the two stable solutions is an analytical continuation of the pre¬ 
bifurcation solution (middle panel in Fig. [2^). In other words infinite systems with different 
Pout(k) and P; n (/e) are destined for the centralized or distributed control mode, corresponding 
to the two branches of the bifurcation diagram, depending on the nature of their degree 
asymmetry. For small systems, the two modes can coexist and jumps between them are 
possible (Supplementary Figs. SI and S2). 

Bimodality in Real Networks. To demonstrate the empirical relevance of these tools, 
we used equations (JTJ) and (J2]) to calculate n r for several real networks, starting from their 
degree distribution. Note that while for some of these networks control is of potential 
relevance (like regulatory networks [30l [32] or neural networks [33]), others like citation net¬ 
works [m ITI] or the WWW pT71 ID] are of little or no relevance for control. We analyze 
them mainly because they offer diverse topologies that test the limits of our predictions. We 
obtain a reasonable agreement between the analytically predicted _/V r theory and N r obtained 
directly for each network (Fig. if, Supplementary Fig. S3). To compare n r for different 
networks, we plot them in function of 1 — no, representing the fraction of nodes that do 
not require external control. Obviously, real networks have widely different degree distri- 


5 


butions, degree asymmetries and even potential correlations [181 [401444] . Despite these the 
predicted bifurcation is observed in real systems as well (Fig. 2b, Supplementary Figure S4). 
We find that some network structures, like the E. coli metabolic network [45], are in the 
“pre-bifurcation” region ( k) < k c ; others, like the mobile call network [461450] and citation 
networks [341436] , follow one or the other branch of the bifurcation diagram, indicating that 
they are characterized by either centralized or distributed control. To identify the control 
mode charactering a particular network, we introduce the transpose network, whose wiring 
diagram is identical to the original network but the direction of each link is reversed. The 
control mode is captured by comparing n r of a network with nj of its transpose network: if 
A n T — n r — nj > 0 a network is centralized and if An r < 0 it is distributed (Supplementary 
Note 3). We find that a network’s degree distribution allows us to infer its control mode 
(Supplementary Note 4). 

Altering the Control Mode. The fact that the two control modes are better suited for 
different tasks raises an important question: can we turn a network initially in the centralized 
mode into a network in the distributed mode, or the other way around? Such a transition 
is achieved by the transpose network. Indeed, we can show that if the original network 
has a distributed control mode, its transpose will be centralized. Yet, in most real systems 
switching the direction of all links is either impossible or infeasible. We find, however, that 
in some networks the transition can be induced by local changes, in some cases requiring 
us to flip the direction of only a single well chosen link (Fig. [3}c) . The precise local change 
required to induce the transition is determined by the network’s size and degree asymmetry: 
the larger the network or the higher degree asymmetry, the more links need to be altered 
to induce this transition. For example, for two manufacturing and consulting networks [51] 
(N = 77, L = 2228 and N = 46, L = 858) the control mode can be altered by flipping only 
one link. In contrast for the prison inmate network [52, [53], food web in Little Rock lake 
[54] and the C. elegans neural network [33] we need to change the direction of several links 
(4, 17 and 54 respectively, representing 2%, 0.7% and 2% of all links) to alter the control 
mode. 

Discussion 

In summary, our attempts to quantify the role of individual nodes have led us to an unex¬ 
pected discovery: the existence of two distinct control modes, whose emergence is governed 
by a bifurcation phenomenon. The control mode is determined by the system’s degree asyrn- 
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metry and has immediate implications on network security or efficiency in organizational 
settings. Most importantly, bimodality is a general property of all dense networks, hence we 
must consider its implications each time we wish to control a complex system with sufficient 
connectivity. 

These results raise several intriguing questions. For example, the finding that the control 
mode can be altered via structural perturbations raises the need for tools to identify the 
minimum number of links whose reversal can help us reach the desired control mode [55] . 
Furthermore, the proposed node classification raises the possibility to correlate the role 
of a node in control with the intrinsic node attributes. Finally, our node classification 
is based on the node’s participation in various MDS’s. However, one can explore other 
node classifications as well, like that based on the energy needed for control, or the time 
necessary to reach the final state [9], important issues that need further investigation. While 
bimodality is ubiquitous in physics, chemistry and biology, the finding that it also plays a 
key role in network control opens new avenues to explore the control of real systems. 
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Methods 

Identifying the driver nodes. We convert a directed network into a bipartite graph 
with two disjoint sets of out and in nodes. A directed link from node i to j corresponds 
to a connection between node i in the out set and node j' in the in set (Supplementary 
Fig. S5). By finding the maximum matching of the bipartite graph, the minimum driver 
nodes are unmatched nodes in the in set. The critical, intermittent and redundant nodes are 
always matched, occasionally matched and never matched nodes in the in set, respectively 
(Supplementary Note 5). 

Identifying the redundant nodes. Redundant nodes are always matched in the bipar¬ 
tite graph. Therefore if we force them unmatched, there would be no alternative matching 
and the number of matched nodes will decrease, inspiring the algorithm we developed to 
identify them: 

(1) Find the maximum matching of the bipartite graph using the Hopcroft-Karp algorithm 
[29} EBHS2]. Obtain a set of matched nodes in the in set (denoted by M). (2) Pick one 
element (denoted by node i) in M. Identify the node in the out set that matches node i 
(denoted by node j). (3) Keep the current matching and temporarily remove node i with all 
its links. Check if there is an augmenting path [29, 58] that starts from node j, ends at an 
unmatched node and alternates between unmatched and matched links on the path. (4) If 
no augmenting path is found, node i needs to be always matched, therefore it is redundant. 
Otherwise node i is replaceable and hence it is intermittent. (5) Add back the removed node 
i and repeat step (2) until all nodes in set M is tested. 

The number of computational steps needed to find one maximum matching is O(N 0 5 L). 
Each matched node requires a breadth first search ( 0(L ) time) for the augmenting path. The 
number of matched nodes is proportional to N. Therefore the complexity of this algorithm 
is 0(NL). 

Calculating n r analytically. Denote with G a bipartite graph and G' = G\i the 
subgraph of G obtained by removing node i and all its links (Supplementary Fig. S6). We 
proved a theorem that node i is not always matched in G if and only if all its neighboring 
nodes are always matched in the subgraph G' (Supplementary Note 6). Based on this 
theorem, the probability that a node in the in set with degree k is not always matched is 
Pk, in = (0 O ut) fc where 6 0ut is the probability that a neighboring node is always matched in the 
out set of the subgraph G'. When averaging over degree distributions in infinite networks, 


the fraction of redundant nodes, which are always matched nodes in the in set, is 

OO 

n r 1 ^ ^ Pin{.k) X Pk i n 

k= 0 
oo 

= 1 - J]P in (A;)(fl out ) fe = 1 - G in (0 O ut), (3) 

k =0 

which gives equation (JT|). 

In the construction of subgraph G' , a neighbor of node i is accessed through a randomly 
chosen link. This implies that the degree distribution associated with the neighboring node 
is the excess degree distribution, i.e. the degree distribution for a node at the end of a 
randomly chosen link. Furthermore, in G' this randomly chosen link is excluded, meaning 
that the degree of the neighboring nodes will be less by 1. Therefore A out satisfies the 
equation 

OO 

1 - flout = ^ Qout(fc)(fl in ) fc - 1 = H out (l - fl in ). (4) 

k=1 

By substituting in for out , we can build a similar equation 

OO 

1 - = 5Z Qin(fc)(flout) fc ' 1 = tfin(Aout). (5) 

k =1 

Combining equations Q and ([5]) , we obtain equation ([2]). 

Networks analyzed. The model networks in this paper are Erdos-Renyi network [26] 
and scale-free network mm generated via the static model [60]. The real networks we 
explored are described in Supplementary Table SI and S2. 
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FIG. 1. (a) A network with five nodes that can be controlled via three driver nodes (Nd = 3). 

The system is characterized by three distinct minimum driver node sets (MDS’s). (b) Node 1 is 
critical as it is part of all MDS’s shown in (a), node 2 is redundant as it does not participate in 
any MDS and nodes 3, 4, 5 are intermittent, participating in some but not all MDS’s. 
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FIG. 2. Bifurcation in control, (a) Distribution of the fraction of redundant (n r ) and critical 
(n c ) nodes in scale-free networks with 7 ou t = 7 i n = 3, documenting the emergence of a bimodal 
behavior for high ( k ). (b) n r and n c (insert) vs (k ) in scale-free networks with degree exponents 
Tout = Tin = 3, illustrating the emergence of a bifurcation for high (k ). (c) n r in scale-free 

networks with asymmetric in- and out-degree distribution, i.e. Tout = 3, Tin = 2.67 (upper branch) 
and Tout = 2.67, Tin = 3 (lower branch). The control mode is pre-determined by their degree 
asymmetry. The solid lines in (b) and (c) correspond to the analytical prediction of equation 
([2]). The dashed line is the discontinuous solution of equation ([ 2 ]) when 7 out = 3 and Tin = 2.67, 
which shows a gap between the actual evolution of n r . (d,e) Networks displaying centralized and 
distributed control. For both networks Nd = 4 and N c = 1, yet they have rather different number 
of redundant nodes, N r = 23 in (d) and N r = 3 in (e). (f,g) The curves i7 out (l — H[ n (x)) + x — 1 
from equation ([ 2 ]) shown for different ( k ), Tout = Tin = 3 in (f) and Tout = 3, Tin = 2.67 in (g). The 
continuous stable solutions of equation ([ 2 ]) are jl^own as filled dots and the discontinuous solutions 
as empty circles. 
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FIG. 3. Bimodality in real networks, (a) The number of redundant nodes N r in real networks 
(Supplementary Tables SI and S2) and the theoretical prediction of equations ([!]) and ([2]). Hollow 
symbols represent networks characterized by centralized control (A n T > 0) and solid symbols 
correspond to the distributed control mode (An r < 0). For networks marked with a star symbol, 
A n r ~ 0 and the control mode can not be identified, (b) n r vs 1 — no for real networks. The 
networks follow the two branches corresponding to centralized and distributed control (highlighted 
bands), (c) A network in the centralized control mode, (d) After flipping the direction of one 
link in (c) (highlighted in both (c) and (d)), the control configuration changes from centralized to 
distributed mode. For both networks IVd = 5 and N c = 2, but in the centralized mode 8 nodes 
play a role in control (2 critical plus 6 intermittent) while in the distributed mode 43 nodes are 
engaged in control (2 critical plus 41 intermittent). 
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